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Abstract 

The random map model is a deterministic dynamical system in a finite phase 
space with n points. The map that establishes the dynamics of the system is 
constructed by randomly choosing, for every point, another one as being its 
image. We derive here explicit formulas for the statistical distribution of the 
number of attractors in the system. As in related results, the number of oper- 
ations involved by our formulas increases exponentially with n; therefore, they 
are not directly applicable to study the behavior of systems where n is large. 
However, our formulas lend themselves to derive useful asymptotic expressions, 
as we show. 
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1. Introduction 



Since the 70's the random map model has attracted the attention of physi- 
cists t^'^l Indeed, in 1987 Derrida and Flyvbjerg have shown that the random 
map model is equivalent to the KaufFman model of cellular automata when the 
number of connections among the automata goes to infinity '^'^L This enlarged 
its application in the realm of theoretical biology, disordered systems, and cel- 
lular automata, for possible approaches of DNA replication, cell differentiation, 
and evolution theory 

On the other hand, one half of a century ago, some mathematicians had ap- 
proached the random map model in the context of random graphs. First, in 
1953, Metropolis & Ulam posed the problem of determining the number ©(n) 
of expected connected components (i.e., attractors) in random graphs with n 
nodes t^^; at the time, G(n) was estimated to be of order logn. Only one year 
later, Kruskal elegantly solved the problem obtaining an exact formula together 
with its corresponding asymptotic behavior : 

e(n) ^ - (In 2n + 7) + e, for n » 1, (1.2) 
2 

where e vanishes for n — > 00, and 7 is the Euler-Mascheroni constant. 

The statistical distribution of the number of connected components was ad- 
dressed both by Rubin & Sitgreaves t^^, and Folkert in his Ph.D. thesis under 
Leo Katz supervision t^l . Unfortunately, these two practically out of reach works 
were never published even though their relevance and an offer by Katz to do 
so t^L Later on, Harris partially reviewed and enlarged these results, proposing 
a new combinatorial expression for the distribution of connected components, as 
well as the complete solution of the simpler case in which the random map is 
one-to-one The mathematical expressions of a general random map found 
by Folkert and Harris, however correct, are based on constrained sums with a 
number of terms of order e"^, and involve Stirling numbers of the first kind t-'^-'^"-'^^]. 
Therefore, unfortunately, their straightforward use in physics or biology appears 
quite limited (for example, it is typical to deal with n ~ 2^^^ in models such as 
those of cellular automata). 

In spite of its importance, and as far as we know, a study of the variance for the 
distribution of connected components has not yet been undertaken. 

In this paper we propose a still new combinatorial formula — equivalent to the 
previous ones, of course — for the statistical distribution of the number of con- 
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nected components. As in earlier results, it also relies on a constrained sum, and 
the involved computational effort increases exponentially with n. Nevertheless, 
it has the advantage of allowing us the derivation of the long-needed asymptotic 
formula for the statistical distribution. Furthermore, we easily deduce from it 
asymptotic formulas for the corresponding average and variance. 

The paper is organized as follows. Section 2 is devoted to the main definitions 
of the random map model, and to settle our conventions. Then, in Section 3, we 
determine an exact combinatorial expression for the statistical distribution of the 
number of connected components in the model. The corresponding asymptotic 
formula for this distribution is derived in Section 4, as well as asymptotic formulas 
for the average and the variance for the number of connected components. To 
end, we present our conclusions in Section 5. 

2. The Random Map Model and Functional Graphs 

Let n = {1,2, ... ,n} be a set of n points. To each point in Q assign at random 
one point in Q with uniform probability distribution, thus defining a function 
/ : — > f2. In this way a dynamical system has been established on the so-called 
phase space ft through the iterations of /; this is the random map model 

Since Q is finite every orbit of / will eventually end in a periodic attractor, and 

several questions are in order, like: what is the expected number of attractors 
in the system? which is the statistical distribution of the number of attractors? 
how large is the dispersion of the distribution? Here, we answer these questions 
starting from combinatorial arguments. 

For each function f on Q define a functional graph whose nodes are precisely 
the elements of Q; moreover, if f{i) = j then draw a directed link from node i 
to node j, and whenever f{i) = i a loop on node i is drawn. As an example. 
Figure 1 shows a functional graph with three connected components (i.e., three 
attractors) in a set with n = 11. 

Note that each function f on Q (or functional graph) can be represented by an 
n X n binary matrix M = {Mij}, where Mij = 1 whenever f{i) = j and Mij = 
otherwise. Every row of matrix M has n — 1 zeros and one T. Clearly, there 
exist n"^ such matrices, and thus n" is the number of functions / that can be 
defined on f2, as well as the number of distinct functional graphs on n nodes. 
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[1] A functional Graph with three connected components for n= 11. 



3. The Distribution of Connected Components 



What is the number an of connected functional graphs (i.e., having precisely one 
connected component) that can be found among the n"^ functional graphs on n 
nodes? Through clever combinatorial arguments Katz obtained 

n— 1 

a„ = (n-l)!5^^, (3.1) 

fe=0 

and showed that, for n >> 1, this figure grows asymptotically like 

(l + o(i)). (3.2) 

Using (3.1) we derive now a new expression for the statistical distribution of the 
number of connected components in functional graphs. Consider first a partition 
V of fl, in k disjoint subsets Qi, . . . ,Qk, with ni, . . . , n/. points, respectively; then 

k 

Uj = n, with 1 < Uj < n, for j = 1, . . . , k. (3.3a) 
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Obviously, 11^=1 is the number of functional graphs G with k connected 
components, such that the subset Qj defines a connected component of G, for 
j — 1, . . . , k. Moreover, for given integers ni, . . . , n^, satisfying (3.3a), the multi- 
nomial coefficient „ ) = „ i^L i yields the number of distinct ways we 
can distribute n objects in distinguishable boxes Bi, . . . , Bk, of sizes ni, . . . , Uk, 
respectively I^^'^^l. This leads to 



k 



{ni,...,nk}' J = l 



as the number of functional graphs with k connected components, where {ni, . . . , Uk}' 
means that the sum is over all vectors (ni, . . . , n^) satisfying (3.3a), and the fac- 
tor stands to account for the unavoidable repetitions incurred by our above 
assumption of 'distinguishable boxes'. 

Therefore, for k = 1, . . . , n, the distribution for the number of connected compo- 
nents can be expressed as 

{ni,...,nfe}' J=l 

It is not difficult to see that the sum in (3.3b) has as much as (^Zj) terms a 
figure growing exponentially with n when /c ~ ^. This fact dooms to failure any 
numerical application of (3.3b) as it stands, and hinders further analytical work. 

The approaches of Folkert and Harris lead to increasing complications. The 
former yields ^^'^'^^ 

p,(fc) = -Ly^ y ( ^ nr...<^ (3.4a) 

ix=k {ni,...,n^}" ^ 

where S^!'^ are the Stirling numbers of the first kind [^^~^^] (see Appendix), and 
{ni, . . . , rin}" means that the sum is over all vectors ni, . . . , n^, constrained by 

rij — n with 1 < < n. (3.46) 
Harris managed to propose ^^^^ 



tJi=k {ni,...,n„}"' ^ ' ' " \ ^ ^ ' 

(3.5a) 
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in which {ni, . . . , n^}"' means that the sum is over all vectors ni, . . . , n„, con- 
strained by 

n n 

rij = n and ^^^jnj = n with < rij < n. {3.5b) 

3=1 3=1 

Expressions (3.4) and (3.5) appear more difficult to handle than (3.3) because of 
the added extra terms via the summation involving Stirling numbers. 

To our knowledge, the statistical moments of the distribution for the number of 
connected components have not yet been obtained in closed form. Further, the 
exact calculation done by Kruskal for the expected number of attractors (1.1) is 
not derived from a distribution (see Ref. [6]). For application purposes (n >> 1) 
it is then worthwhile to derive manageable asymptotic formulas for Pn{k); this 
is the subject of the next section. 

4. Asymptotic Expressions 

Let us start by defining 

/5m = 2e-^^^ (4.1) 
(m — 1)! 

and 

am = 1 - 13m- (4.2) 
Due to the asymptotic relation (3.2) it happens that 

= 1 + C , for m » 1, (4.3a) 

and 

a^ = 0\—\, form»l. (4.36) 
\m J 

Now, express (3.3b) in terms of (3m and use (3.3a) to get 

{ni,...,nfc}' J = l ■' 

Constraint (3.3a) can be broken by introducing a Kronecker delta inside the 
summation. Using the integral representation 
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where F is any closed contour in the complex plane of z such that the origin is 
inside, we go back to the expression for Pn{k) to obtain 

with a modicum of algebra, where 

9k{z) = [<fip (z)] with (fp (z) — _JI!L^rn p > n. (4.4a) 

m=l 

Since gk{z) is an analytic function we can apply the Cauchy integral theorem to 
find 

as an alternate way to compute Pn{k). 

The computation of (4.4) may be as difficult as that of (3.3); however, the former 
is manageable for n » 1, as we shall see. First note that (4.4b) does not 
depend on the particular value we give to p in (4.4a), as long as p > n, a feature 
that we shall use to our convenience. Taking p = n in (4.4a), gk{z) becomes 
a polynomial of degree nk, and its n-th derivative may be calculated in terms 
of finite differences with the help of Stirling numbers (see formulas (^.6) in 
Appendix), obtaining 

nk Q(n) 

(0) -n\J2 -^^"^dk (0) , (4.5a) 

where Sr^^ are the Stirling numbers of the first kind, and 

A^^fe(O) = J^(-l)^— f '■')<7fc(m). (4.56) 

m=l ^ ' 

Now we provide asymptotic approximations for (pp (z) in (4.4a) with p = n and 
n » 1. Consider first | 2; |> 1, and note that, due to (4.3), 

fc=i k=i ^ \ / / 

Now, the Euler-Maclaurin formula states t^^'^^l 

/ fix)dx+-{f{n)+f{l)) 
k=i ''^ 

00 jj 
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where / (x) is a function over the interval [l,n], and B). are the BernouUi 
numbers defined by the generating function 



e* - 1 ^ A;! ' ' 

fe=0 

with B2k+i = 0, for = 1, 2, . . . 

Let us take f{x) — ^ with | 2; |> 1, and substitute in (4.6a) obtaining 



/^^ ^ 



Eh>k 
k 

k=l 

Using (4.6b) yields 

k 



n 



2fe-l 



1 + 



n 



z — 1 n \ \n 



for 1 <| 2 |< e 



27r 



which can be continued analytically for any | ^ |> 1, thus obtaining 



{Z) = > Z 

m 



m=l 



Z Z 

z — 1 n 



Cn for \ z\> 1, 



where 



Instead, for ^ = 1, clearly 



Cn = l + 



(1) = — ~ (lnn + 7 -Tn)r7n, 



n 



m—l 



where 



nlnn 



m=l 



(4.66) 



(4.7a) 



(4.76) 



(4.7c) 



(4.7d) 



(4.7e) 



and 7 = 0.577 215 66 . . . is the Euler-Mascheroni constant. Note that, due to 
(4.3b), when n — > 00 in (4.7e), the limit r exists. Substituting back (4.7a,b) into 
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(4.4a), expanding then in Taylor series the term for | 2; |> 1, and cutting the 
series beyond order nk we come to 



r(/c + m) 



m=0 

with 

{k)m = k{k + l)...{k + m-l) 



m 

While substituting (4.7c-e) into (4.4a) gives us 

^fc(l) f«(lnn + 7 -T„)Sn- (4-86) 



Replace (4.8) into (4.5b) to get 

^^9k (0) ^ i-ir' r [g, (1) - A(f )] + ^-frJ^H,, (4.9) 



where we have used (^.4) to introduce the Stirling numbers of the second kind 
Tp^^\ and 

^.(A;) ^ {k)l 



The summation for \^n^ can be found in summation tables yielding ^^^^ 



nk 

E 

1=0 



{k)i [kin + 1)] 



l\ k\{nk)\ 



Next, applying Stirling's approximation (s! ~ V^irss^e * for s » 1) to the 
factorials for n » 1, one finds A ~ e'', while ^^(1) ^ e''^H^''n+f-rr,]+inrjr,}. 
thus, A is exponentially small as compared to 5^fc (1), and can be neglected in 
(4.9). Replacing back (4.9) into (4.5a), using properties (A. 5), and substituting 
in (4.4b), we find for the asymptotic distribution of the number of attractors 

Pn ik) - (1) E TprUlSt"^ (k) , (4.10) 

l=n '' 

where 
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We show now that Nn{k) is a null term, that is to say 

n 

lim y^k'^Mnik) = (4.12) 



n—KX 

k=l 



for any m > 0, and therefore it does not contribute to the averages taken with 
Pn{k), forn >> 1. First note that N'n (k) ~ for any k, so as in the summation 

(4.12) the terms with k ~ 0{1) do not contribute. Hence, we can make the 
approximation for n » 1 and k » 1 in (4.11), and calculate (4.12) with it, 
obtaining 



e-(i+2k) 1 



1 



^/n kl 
Since 

J2k^^=V^{x)e^, 



k 



k\ 

fe=l 



where Vmix) is a polynomial in x of degree m, we have 



and thus the expression goes to zero as which proves (4.12). 

Then, (4.10) can be expressed very simply in terms of Stirling numbers of the 
first kind by noting the following: let us take for a moment OLm = in (4.4a), 
and also set p = oo with | ^ |< 1 (so the infinite sum converges), obtaining 



oo ^ ^ ^ 



Qk 



(^)= E--"^ ={-ln(l-.)}\ 



m 

\m=l 



where the tilde indicates that we have set am = 0. Using {A.2) and substituting 
in (4.4b) we arrive to 



Comparing with (4.10), we obtain, up to the null term, that 

p^k) ^ ^ i-ir' Si!'^ {^)\ (4.13a) 
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where 



f^n = l-^^. (4.136) 
7 + mn 



Since pn {k) is a statistical distribution, 

n 

lim y^pn{k) = 1. 



Which allows us to compute r = lim^^oo '^'n in (4.13b), by using property (A.l) 
of Stirling numbers, giving 

T = ln2. (4.13c) 

We can replace back (4.13c) into (4.13a) and (4.13b), yielding and error of order 
O (1/lnn), for 1 < /c < n. So we have, for the asymptotic approximation of the 
distribution of the number of attractors, the following expression: 

p„W«f;(-ir'5«(|)'(l + 0(^)). (4.14.) 



where 



p=l-^^. (4.146) 
inn 



Now we can use (4.14) to calculate averages. Using expression {A.3), we have 

e(n) ^^(ln2n + 7 + o(j^)) (4.15) 

for the average number of attractors (n), which coincides, as it should be, with 
Kruskal asymptotic approximation (1.2). The variance can also be computed by 
using the derivative of (^1.3): 

.= «e(„)(l + o(^)), (4.16) 

which is a new result. 
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Conclusion 



We have proposed a new and simpler expression for the distribution of the number 
of attractors in the random map model (Eq. (3.3)). The number of operations 
involved for the numerical evaluation of the distribution grows exponentially 
with n, thus making expression (3.3) useless for direct calculations for n >> 1. 
To overcome this difficulty, however, we have derived an asymptotic formula 
(Eq. (4.14)), from which we directly deduced asymptotic values both for the 
average number of attractors and for its variance (Eqs. (4.15) and (4.16), resp.). 

In the random map model, additional statistical figures are of interest. Among 
them, the average attractor size, the average period (or average length) of the 
cycles; also, given a point x in the phase space, the expected length of its orbit, 
the expected length of the cycle in the attractor containing x, the expected 
number of points from which x can be attained, etc. Some of these figures have 
already been computed (for example, see Ref. [2,3,10]), and others are subject of 
recent studies ^^^K 
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Appendix: Properties of Stirling Numbers 



Here we list the equations and definitions on Stirling numbers necessary to follow 
the calculations in this article. For a more extensive treatment see Refs. [11-13]. 
Stirling numbers of the first kind are generated by the functions 

n 

z{z + l)...{z + n-l)=Y^ (-l)"-"^5(^)z"^ {A.l) 

m=l 

and 

CO (,{k) 

{\n{l + z)f = k\Y,^z'' for \z\<l. (A2) 



r=k 



Applying the operator z4z to equation (^.1) we get the important relation 



' dz 

n 



z{z + l)...{z + n-l)J2 = E (-1)""" S^r^mz-. {A.3) 



z + m _ 

m=l m=l 



Stirling numbers of the first kind may be expressed in closed form by 

k—n 



1=0 

where 



\k-n + l)\k-n-in 



■ k=Q ^ ^ 

are the Stirling numbers of the second kind. It is known that Stirling numbers 
verify the relations 

S^^^ =0 if A; < n, (A. 5a) 

T^^^ = if j9 < {AM) 

and 

n n 



k=m k=m 



By means of Stirling numbers of the first kind it is possible to express derivatives 
of a function in terms of finite differences by the formula 

jm °° c("^) 



dz"^ ' ' ^ k\ 

k=m 
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if the summation is convergent, and where A/ (x) = f (x + 1) — f (x) and 

AV(;^) = E(-1)'-'Q/(^ + 0- (AM) 
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